feat: streamline benchmark quick/all execution flow by outbounder · Pull Request #9 · camplight/knowledgeplane

outbounder · 2026-04-17T07:31:21Z

Summary

add root-level benchmark workflows (bench:quick, bench:all, bench:quick:norerank) driven by scripts/bench-with-stack.sh to start required services, warm benchmark credentials, and run benchmark suites end-to-end
improve benchmark resilience and runtime behavior by supporting reranker soft-skip/strict modes, scaling freshness FAISS baseline defaults for quick runs, and adding DB warm-up/credential sync via scripts/bench-warm-db.ts
fix repeated benchmark auth/cache issues by loading .env.benchmark in benchmark compose, persisting /root/.cache for HF/model assets, failing fast on 401/403 in the adapter, and updating benchmark docs/SPEC accordingly

Test plan

Run BENCH_SKIP_RERANKER=1 npm run bench:quick and verify successful completion (EXIT_CODE=0)
Confirm benchmark archives are created under tests/benchmarks/runs/ for freshness/hotpot/msmarco quick runs
Re-run BENCH_SKIP_RERANKER=1 npm run bench:quick and confirm faster second-run execution with cache in place

Made with Cursor

Add root-level bench commands that start required services, warm benchmark credentials, and support optional reranker/FAISS skips while fixing benchmark container env/cache behavior and auth failure handling. Made-with: Cursor

Make OpenAI/db type usage compatible across SDK and TS lib variants, and scope root lint to the configured workspace so CI lint/typecheck checks pass reliably. Made-with: Cursor

outbounder added 2 commits April 17, 2026 10:26

feat: streamline benchmark quick/all execution flow

d2fd8bc

Add root-level bench commands that start required services, warm benchmark credentials, and support optional reranker/FAISS skips while fixing benchmark container env/cache behavior and auth failure handling. Made-with: Cursor

fix: resolve CI lint and typecheck regressions

23414b4

Make OpenAI/db type usage compatible across SDK and TS lib variants, and scope root lint to the configured workspace so CI lint/typecheck checks pass reliably. Made-with: Cursor

outbounder merged commit e5a6fc1 into main Apr 17, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: streamline benchmark quick/all execution flow#9

feat: streamline benchmark quick/all execution flow#9
outbounder merged 2 commits into
mainfrom
88-make-benchmarks-run-quickly-and-be-easy-to-execute

outbounder commented Apr 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

outbounder commented Apr 17, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant